Audio-to-score alignment of piano music using RNN-based automatic music transcription

نویسندگان

  • Taegyun Kwon
  • Dasaem Jeong
  • Juhan Nam
چکیده

We propose a framework for audio-to-score alignment on piano performance that employs automatic music transcription (AMT) using neural networks. Even though the AMT result may contain some errors, the note prediction output can be regarded as a learned feature representation that is directly comparable to MIDI note or chroma representation. To this end, we employ two recurrent neural networks that work as the AMT-based feature extractors to the alignment algorithm. One predicts the presence of 88 notes or 12 chroma in frame-level and the other detects note onsets in 12 chroma. We combine the two types of learned features for the audio-to-score alignment. For comparability, we apply dynamic time warping as an alignment algorithm without any additional post-processing. We evaluate the proposed framework on the MAPS dataset and compare it to previous work. The result shows that the alignment framework with the learned features significantly improves the accuracy, achieving less than 10 ms in mean onset error.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Music Transcription based on Non-Negative Matrix Factorization

In this paper, we present a method for the automatic transcription of polyphonic piano music. The input to this method consists in piano music recordings stored in WAV files, while the pitch of all the notes in the corresponding score forms the output. This method operates on a frame-by-frame basis and exploits a suitable time-frequency representation of the audio signal. The solution proposed ...

متن کامل

Short-Term Memory and Event Memory Classification Systems for Automatic Polyphonic Music Transcription

Music transcription consists in transforming the musical content of audio data into a symbolic representation. The objective of this study is to investigate a transcription system for polyphonic piano. The input to this system consists in piano music recordings stored in WAV files, while the pitch of all the notes in the corresponding score forms the output. The proposed method focuses on tempo...

متن کامل

Static and Dynamic Classification Methods for Polyphonic Transcription of Piano Pieces in Different Musical Styles

In this paper, we present two methods based on neural networks for the automatic transcription of polyphonic piano music. The input to these methods consists in piano music recordings stored in WAV files, while the pitch of all the notes in the corresponding score forms the output. The aim of this work is to compare the accuracy achieved using a feedforward neural network, such as the MLP (Mult...

متن کامل

Clavision: visual automatic piano music transcription

One important problem in Music Information Retrieval is Automatic Music Transcription, which is an automated conversion process from played music to a symbolic notation such as sheet music. Since the accuracy of previous audiobased transcription systems is not satisfactory, we propose an innovative visual-based automatic music transcription system named claVision to perform piano music transcri...

متن کامل

Improvement of an Online Music Alignment Based on Onset Information

Music alignment is the association of events in a musical score with points in the time frames of an audio signal. So it is also called audio-score alignment. Until now, most polyphonic audio-score alignment methods are offline algorithms, and alignment accuracy drops significantly when using online algorithms. But most applications based on alignment need real-time results, such as Automatical...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1711.04480  شماره 

صفحات  -

تاریخ انتشار 2017